Modified Constraint Scores for Semi-Supervised Feature Selection
نویسندگان
چکیده
Semi-supervised constraint scores, which utilize both pairwise constraints and the local property of the unlabeled data to select features, achieve comparable performance to the supervised feature selection methods. The local property is characterized without considering the pairwise constraints and these two conditions are introduced independently. However, the pairwise constraints and the local property may contain conflicting information. In this paper, we utilize the conflicting information to improve the local property. Instead of characterizing the local property by all neighbors, samples which do not appear in the cannot-link constraints can be used. A performance indicator, called neighborhood-cannotlink (NC) coefficient, is proposed to measure the improvement of the local property. We use the improved local property and the pairwise constraints to perform semi-supervised constraint scores algorithm. Experiments on several real world data sets demonstrate the effectiveness of the methods.
منابع مشابه
Constraint scores for semi-supervised feature selection: A comparative study
HAL is a multi-disciplinary open access archive for the deposit and dissemination of scientific research documents, whether they are published or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers. L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau...
متن کاملSemi-Supervised Feature Selection with Constraint Sets
In machine learning classification and recognition are crucial tasks. Any object is recognized with the help of features associated with it. Among many features only some leads to classify object correctly. Feature selection is useful technique to detect such specific features. Feature selection is a process of selecting subset of features to reduce number of features (dimensionality reduction)...
متن کاملWised Semi-Supervised Cluster Ensemble Selection: A New Framework for Selecting and Combing Multiple Partitions Based on Prior knowledge
The Wisdom of Crowds, an innovative theory described in social science, claims that the aggregate decisions made by a group will often be better than those of its individual members if the four fundamental criteria of this theory are satisfied. This theory used for in clustering problems. Previous researches showed that this theory can significantly increase the stability and performance of...
متن کاملWised Semi-Supervised Cluster Ensemble Selection: A New Framework for Selecting and Combing Multiple Partitions Based on Prior knowledge
The Wisdom of Crowds, an innovative theory described in social science, claims that the aggregate decisions made by a group will often be better than those of its individual members if the four fundamental criteria of this theory are satisfied. This theory used for in clustering problems. Previous researches showed that this theory can significantly increase the stability and performance of...
متن کاملSemi-Supervised Spectral Mapping for Enhancing Separation between Classes
We present a spectral mapping technique for semisupervised pattern classification. Importance scores of features are firstly evaluated with a semi-supervised feature selection algorithm by Zhao et al. Training data are then embedded into a low-dimensional space with a spectral mapping derived from the selected and weighted feature vectors with which test data are classified by the nearest neigh...
متن کامل